Individualized Error Estimation for Classification and Regression Models

نویسندگان

  • Krisztian Buza
  • Alexandros Nanopoulos
  • Lars Schmidt-Thieme
چکیده

Estimating the error of classification and regression models is one of the most crucial tasks in machine learning. While the global error is capable to measure the quality of a model, local error estimates are even more interesting: on the one hand they contribute to better understanding of prediction models (where does and where does not work the model well), on the other hand they may provide powerful means to build successful ensembles that select for each region the most appropriate model(s). In this paper we introduce an extremely localized error estimation, called individualized error estimation (IEE), that estimates the error of a prediction model M for each instance x individually. To solve the problem of individualized error estimation, we apply a meta model M. We systematically investigate various combinations of elementary models M and meta models M on publicly available real-world data sets. Further, we illustrate the power of IEE in the context of time series classification: on 35 publicly available real-world time series data sets, we show that IEE is capable to enhance state-of-the art time series classification methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Estimation of Required Rotational Torque to Operate Horizontal Directional Drilling Using Rock Engineering Systems

Horizontal directional drilling (HDD) is widely used in soil and rock engineering. In a variety of conditions, it is necessary to estimate the torque required for performing the reaming operation. Nevertheless, there is not presently a convenient method to accomplish this task. In this paper, to overcome this difficulty based on the basic concepts of rock engineering systems (RES), a model for ...

متن کامل

تعیین مناسب‌ترین روش برآورد رسوب معلق بر اساس آماره‌های خطاسنجی (مطالعه موردی-تعدادی از زیرحوزه‌های کشف‌رود)

The phenomena of erosion, sediment transport and sedimentations have tremendously destructive effects on environment and hydraulics structures. In general, sediment transportation depends on river discharges, but the proposed equations inherited with large errors. To evaluate the suspended sediment loads and an optimized model on them, in this research, data were collected from some sub-watersh...

متن کامل

Derivation of regression models for pan evaporation estimation

Evaporation is an essential component of hydrological cycle. Several meteorologicalfactors play role in the amount of pan evaporation. These factors are often related to eachother. In this study, a multiple linear regression (MLR) in conjunction with PrincipalComponent Analysis (PCA) was used for modeling of pan evaporation. After thestandardization of the variables, independent components were...

متن کامل

Experimental Evaluation of Algorithmic Effort Estimation Models using Projects Clustering

One of the most important aspects of software project management is the estimation of cost and time required for running information system. Therefore, software managers try to carry estimation based on behavior, properties, and project restrictions. Software cost estimation refers to the process of development requirement prediction of software system. Various kinds of effort estimation patter...

متن کامل

Solar Radiation Estimation from Rainfall and Temperature Data in Arid and Semi-arid Climates of Iran

Precipitation and air temperature data, only, are often recorded at meteorological stations, with radiation beingmeasured at very few weather stations, especially in developing countries. Therefore there arises a need for suitablemodels to estimate solar radiation for a completion of data sets. This paper is about an evaluation of eight models foran estimation of daily solar radiation (Q) from ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010